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Abstract 

In this paper, we propose a novel class of Nash problems for Cognitive Radio (CR) networks composed 
of multiple primary users (PUs) and secondary users (SUs) wherein each SU (player) competes against 
the others to maximize his own opportunistic throughput by choosing jointly the sensing duration, the 
detection thresholds, and the vector power allocation over a multichannel link. In addition to power 
budget constraints, several (deterministic or probabilistic) interference constraints can be accommo- 
dated in the proposed general formulation, such as constraints on the maximum individual/aggregate 
(probabilistic) interference tolerable from the PUs. To keep the optimization as decentralized as possi- 
ble, global interference constraints, when present, are imposed via pricing; the prices are thus additional 
variables to be optimized. The resulting players' optimization problems are nonconvex and there are 
price clearance conditions associated with the nonconvex global interference constraints to be satisfied 
by the equilibria of the game, which make the analysis of the proposed game a challenging task; none of 
classical results in the game theory literature can be successfully applied. To deal with the nonconvex- 
ity of the game, we introduce a relaxed equilibrium concept— the Quasi-Nash Equilibrium (QNE)— and 
study its main properties, performance, and connection with local Nash equilibria. Quite interestingly, 
the proposed game theoretical formulations yield a considerable performance improvement with respect 
to current centralized and decentralized designs of CR systems, which validates the concept of QNE. 

1 Introduction and Motivation 

In the last decade, Cognitive Radio (CR) has received considerable attention as a way to improve the 
efficiency of radio networks [TJ [2]. The CR networks adopt a hierarchical access structure where the 
primary users (PUs) are the legacy spectrum holders while the secondary users (SUs) are the unlicensed 
users who sense the electromagnetic environment and adapt their transceiver parameters as well as the 
resource allocation decisions in order to dynamically access temporally unoccupied spectrum regions. 

The challenge for a reliable sensing algorithm is to identify suitable transmission opportunities without 
compromising the integrity of the PUs. One of the design criteria is to make the probability of false alarm 
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as low as possible, since it measures the percentage of vacant spectrum which is misclassified as busy, 
increasing thus the opportunistic usage of the spectrum from the SUs. On the other hand, in order to 
limit the probability of interfering with PUs, it is desirable to keep the missed detection probability as 
low as possible. The detection thresholds are the trade-off factor between the false alarm and the missed 
detection probabilities; low thresholds will result in high false alarm rate in favor of low missed detection 
probability and vice versa. Alternatively, the choice of the sensing time offers a trade-off between the 
quality and speed of sensing: increasing the sensing times permits to reach both low false alarm and 
detection probability values, thus reducing the time available for secondary transmissions, which would 
result in low SUs' throughput. The above trade-off calls naturally for a joint determination of the sensing 
and transmission parameters of the SUs, under the paradigm of selfish behavior among these users. The 
modeling and analysis of this competitive multi-agent optimization is the main subject of this work. 

1.1 Related work 

The joint optimization of the sensing and transmission parameters has been only partially addressed in 
the literature; current works on the subject can be divided in the following three main classes. 

- Optimization of the sensing parameters only [3j SI [Tl [8] - In [3 i], the authors proposed 
alternative centralized schemes that optimize the detection thresholds for a bank of energy detectors, 
in order to maximize the so-called opportunistic throughput, while keeping the sensing time and the 
transmission parameters of the SU fixed and given a priori. The optimization of the sensing time and the 
sensing time/detection thresholds for a given missed detection probability and transmission rate of one 
SU was addressed in [5] and [TJ [8] , respectively. A throughput-sensing trade-off for a given transmission 
rate of the SU was studied in [6]. In the above papers there is no optimization of the SU's transmission 
strategies, and the proposed schemes are applicable only to CR scenarios composed by one PU and SU. 

- Optimization of the transmission parameters only [lOl HH [121 [131 [141 HU [13 [18] . These 
papers address the optimization of the SUs' transceivers in a multiuser OFDM CR scenario; several 
centralized [TOl \TT\ [TH] schemes or distributed algorithms based on a game theoretic approach |12[ [TBI O 
[To] were proposed; a recent overview of the latter approaches can be found in |15[I19|. A general framework 
based on variational inequalities was proposed in |16[ [T7| to study and solve in a distributed way convex 
noncooperative games with (possibly) side (i.e., coupling) constraints (e.g., interference constraints). In 
all the aforementioned papers the sensing process is not considered as part of the optimization; in fact 
the SUs do not perform any sensing but they are allowed to transmit over the licensed spectrum provided 
that they satisfy interference constraints imposed by the PUs, no matter if the PUs are active of not. 

- Joint optimization of some of the sensing/transmission parameters [201 ED (211 [23J [Ml [25] . In 
|20| (or |21[ I22|). the sensing time and the transmit power (or the power allocation |21[ [22] over multi- 
channel links) of one SU were jointly optimized while keeping the detection probability (and thus the 
decision threshold) fixed to a target value. In |23[I24|. the authors focused on the joint optimization of the 
power allocation and the equi-false alarm rate of one SU |23| over multi-channel links, for a fixed sensing 
time. The case of multiple SUs and one PU was considered in |24| (and more recently in |25|). under the 
same assumptions of [23]; however no formal analysis of the proposed formulation was provided. Moreover, 
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these papers |23[ [2-H [25] did not consider the sensing overhead as part of the optimization, leaving unclear 
how to optimally choose the sensing time. 

1.2 Main contributions 

What emerges from the analysis of existing literature is that, up to now, state-of-the-art approaches 
have focused on the optimization of specific features and single components of a CR system in isolation. 
In this paper we move a step ahead and propose a novel class of Nash Equilibrium Problems (NEPs), 
suitable for designing multiple primary/secondary user CR networks, under different practical settings. 
Aiming to dynamically access temporally unoccupied primary spectrum regions, in our formulation, the 
SUs maximize their own opportunistic throughput by jointly optimizing the sensing time, the decision 
thresholds of a bank of energy detectors, and the power allocation over multi-channel links. Because of 
sensing errors, it may happen however that the SUs attempt to access part of licensed spectrum that 
is still occupied by active PUs, causing thus harmful interference. This motivates the introduction of 
probabilistic (rather than deterministic) interference constraints that are imposed to control the power 
radiated over the licensed spectrum only when a missed detection event occurs (in a probabilistic sense). 
The proposed formulation accommodates alternative combinations of power/interference constraints. For 
instance, in addition to classical (deterministic) transmit power (and possibly spectral masks) constraints, 
we envisage the use of (probabilistic) average individual (i.e., on each SU) and/or global (i.e., over all 
the SUs) interference tolerable by the primary receivers. Local interference constraints fit naturally to 
scenarios where the SUs are not willing to cooperate; whereas the global ones, which are less conservative, 
are more suitable for settings where SUs may want to trade some signaling for better performance. By 
imposing a coupling among the power allocations of the SUs, global interference constraints introduce 
a new challenge in the system design: how to enforce global interference constraints without requiring 
a centralized optimization? A major contribution of the paper is to address this issue by introducing a 
pricing mechanism in the game, through a penalization in the players' objective functions. The prices will 
be then additional variables to be optimized, while guaranteeing the global interference constraints to be 
satisfied at any solution of the game. 

The resulting game-theoretical formulations belong to the class of nonconvex games, where the non- 
convexity occurs at both the objective functions and feasible sets of the SUs' optimization problems. 
Indeed the objective functions of the SUs (to be maximized) are not quasi-concave and the local/global 
interference constraints are bi-convex and thus nonconvex; moreover, in the presence of global interference 
constraints, there are possibly unbounded price variables to be optimized and pricing clearing conditions 
associated with these constraints to be satisfied at the equilibrium. All this makes the analysis of the 
proposed games a challenging task; none of previous results in the game theory literature can be success- 
fully applied (see Sec. [J] for more details); without (quasi-) concavity of the players' payoff functions or 
convexity of the strategy sets, a solution of the game— the Nash Equilibrium (NE)— may not even exist 
|26| I27j . To overcome this issue, alternative refinements of the NE concept have been introduced in the 
literature, in the form of "local" solutions (see, e.g., |26 [ [28 l [29 ) I30| and references therein): in contrast to 
the NE that is resistant to "arbitrarily large" deviations of single player's strategies, local equilibria require 
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stability only against "small" unilaterally deviations of the players. Among all, we mention here some local 
solution concepts that have been proposed in the literature: such as the "critical NE" [26], the "Local NE" 
(LNE) [29], and the "generalized equilibrium" [30]. Despite their theoretical interest, those local solution 
concepts have a very limited applicability: when available, there are only abstract mathematical conditions 
granting their existence (see, e.g., |26[ 130] ). whose verification for games arising from realistic applications 
such as those presented in the present paper remains elusive. A system design based on these concepts 
may lead to an unpredictable system behavior (e.g., equilibrium existence vs. nonexistence, convergence 
of algorithms vs. nonconvergence) , which is not a desirable feature in practice. 

In this paper, we propose an alternative formalization of solution in nonconvex games; building on the 
well-established and accepted definition of stationary solutions of a nonlinear programming, we introduce 
a (relaxed) equilibrium concept for a nonconvex NEP, which is a solution of the aggregated stationarity 
conditions (the Karush-Kuhn- Tucker conditions) of the players' optimization problems. We call such a 
stationary tuple a quasi-Nash equilibrium (QNE) of the (nonconvex) game; the prefix "quasi" is intended 
to signify that a NE (if it exists) must be a QNE under mild conditions (constraint qualifications) . While 
the concept of QNE seems quite natural and conceptually similar to the idea of critical NE [26] and 
generalized equilibrium [30] . its analysis is not an easy task (cf. Sec.H]), and cannot be addressed using 
results in previous papers (e.g., |26[ [30]): for instance, in the absence of convexity or boundedness of 
the price variables that could lead to the non-existence of a NE, the existence of a QNE is by no means 
obvious. Building on our recent results in [31] ; a major contribution of the paper is to provide a satisfactory 
characterization of the QNE, in terms of existence, connection with the (L)NE, and performance. The 
main features of the QNE are the following: i) it always exists for the proposed class of nonconvex games; 
ii) it is shown to have some local optimality properties (in the sense described in Sec. [6]); and iii) the 
proposed joint optimization of sensing and transmission strategies based on the QNE yields a considerable 
performance improvements (more than one hundred per cent in "high" interference regimes) with respect 
to both centralized [18] and decentralized |12| [T4"l [15] [T7] CR designs that do not perform any joint 
optimization. These desired properties validate the use of the QNE both theoretically and practically. 

At the QNE of the game, the optimal sensing times of the SUs tend to be different, implying that 
some SUs may start transmitting while some others are still in the sensing phase, which may introduce 
a significant performance degradation in the sensing process of the other SUs. To overcome this issue, a 
further contribution of the paper is to propose a distributed approach to "force" (at least asymptotically) 
the same optimal sensing time for all the SUs; which is referred to as the equi-sensing case. 

In summary, the main contributions of the paper are the following: 

• We propose a novel class of NEPs (possibly with pricing), where for the first time a joint optimization 
of the detection threshold, the sensing time, and the transmission strategies of multiple SUs is performed, 
under power and several (novel) local and/or global probabilistic interference constraints; 

• We introduce a relaxed equilibrium concept— the QNE— and develop a novel optimization-based theory 
for studying its main properties, the connection with the LNE, and its performance; 

• We show how to modify the original NEPs to impose in a distributed way an optimal equi-sensing time 
to all the SUs, and study the main properties of the resulting game. 
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Overall, we introduce a new line of analysis based on Variational Inequalities (Vis) in the literature of 
nonconvex games (possibly) with pricing that is expected to be broadly applicable for other game models. 

Because of the space limitation, we omit here details on how to solve the proposed games, which is the 
subject of a companion paper [32] . where we propose alternative distributed algorithms along with their 
convergence properties and a detailed discussion on their practical implementation; and also establish the 
existence of NE of the games under a set of small-interference assumptions between the SUs and PUs 
(that are not needed for the QNE). 

The rest of the paper is organized as follows. Sec. [2] describes the system model, whereas Sec. [3] 
introduces the new game-theoretical formulations. The analysis of the game is addressed in Sec.Ul where 
the LNE and the concept of QNE are introduced, and their properties and connection are studied. Sec. 
[5] focuses on the equi-sensing case; whereas Sec. [6] outlines some extensions of the proposed formulations 
covering more general settings. Sec .0 provides some numerical results showing the quality of the proposed 
approach; and finally Sec. [8] draws the conclusions. 

2 System Model 

We consider a scenario composed of Q active SUs, each formed by a transmitter-receiver pair, coexisting in 
the same area and sharing the same band. We focus on block transmissions over SISO frequency-selective 
channels. It is well known that, in such a case, multicarrier transmission is capacity achieving for large 
block-length. Based on the 802. 22- Wireless- Regional- Area-Networks standard (WRAN) [33), we assume 
that the Medium Access Control (MAC) frame is divided in two slots: one sensing slot and one data slot. 
During the sensing interval, the SUs stay silent and sense the electromagnetic environment looking for the 
spectrum holes, whereas during the data slot they transmit (possibly) simultaneously over the portions 
of the licensed spectrum detected as available. The sensing and transmission phases are described in Sec. 
12.11 and Sec. 12.21 respectively, whereas the proposed joint optimization of the sensing and transmission 
strategies over the two phases is introduced in Sec. [3l 

We wish to point out that, even if we focus on ad-hoc networks and adopt the classical CR terminology 
(e.g., PUs, SUs, etc.), the proposed model and results are applicable to broader scenarios, wherever there 
are heterogeneous (small-cell) networks organized in a hierarchical or multi-tier structure and sharing 
the same spectrum (e.g., due to the spatial reuse of the frequencies). This happens for example in 
femtocell networks, where femto access points (FAPs) serve femto users, subject to some constraints on 
the interference radiated toward mobile users served by macro base stations. 

2.1 The spectrum sensing problem 

For notational simplicity, we preliminarily introduce the sensing problem in a CR scenario composed of 
one active PU and under some simplified assumptions; a more general setting (e.g., multiple active PUs) 
is addressed in Sec. [6] within the framework of robust detection. 

The spectrum sensing problem of SU q = 1, . . . , Q on subcarrier k = 1, . . . , N is formulated as a binary 
hypothesis testing, with the following two sets of hypotheses: at time index n = 1,2,..., K q , 
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(PU signal absent) U k \ : y q ,k[n] = w q>k [n] n\ 

(PU signal present) H k \x ■ y q ,k[ n ] = I q,k[ n \ + w q,k[ n ]i 
where Hk\o represents the absence of any primary signal over the subcarrier A;— the received baseband 
complex signal y q ,k\ n ] contains only additive background noise w q ^[n] — and T-L k \i represents the presence 
of (at least) one PU— the received signal contains the primary signaling /g,fcM corrupted by noise— and 
K q = [_Tq f q \ ~ T q f q is the number of samples, with r q and f q denoting the sensing time and the sampling 
frequency, respectively. 

Given the hypothesis testing problem in ([1]), we make the following standard assumptions in the 
literature of sensing algorithms [HI SI El El [TJ [HI [20l EH [23]: (Al) the noise io ?i fc[n] is drawn by a circu- 
larly symmetric complex stationary (ergodic) Gaussian process with zero mean and variance <J 2 qk \ (A2) 
the primary signaling I q ,k[ n ] are samples of a circularly symmetric complex stationary (ergodic) random 
process with zero mean, variance a\ fc , and statistically independent of the noise process; (A3) the sam- 
ple frequency f q is chosen so that the random variables (RVs) {yq,fc[?i<]}n are independent, for all q and 
k (AA) the system parameters— the spectrum occupancy status of the PUs and the primary/secondary 
(cross-) channels— change sufficiently slowly such that they can be assumed to be constant over each sens- 
ing/transmission interval; and (A5) the noise variance a 2 k and the power cr| fe of the primary signaling 
are estimate with no errors at the secondary receiver q. Note that (A5) can be relaxed by taking explicitly 
into account estimation errors in the knowledge of a 2 q k and a\ fc as well as the effect of shadowing and 
fading, as outlined in Sec. [6j see also [35]. 

Within the Neyman-Pearson framework, the test statistic of SU q over subcarrier k based on the energy 
detector maximizing the detection probability under a given false alarm rate is |36j : 

D ( Y ^) = ^EKfeNI 2 H f ^ (2) 

K <i n=l «*|o 

where ~{ qk is the decision threshold of SU q for the carrier k = 1,...,N, to be chosen to meet the 
required false alarm rate. Note that if the received samples can be assumed Gaussian distributed (and 
uncorrelated) , the above energy-based decision is optimal; otherwise it is still a valuable choice when no 
a-priori information is available on the primary signal features. Moreover, it is interesting to report some 
results in |37[ I38|. where it has been argued that for several models, if the probability density functions 
under both hypotheses are perfectly known, energy detection performs close to the optimal detector. The 
case of partial knowledge of the probability density functions is discussed in Sec. [6] within the context of 
robust detection; see also |35j . 

Invoking the central limit theorem, the random variable D (Y„ &) can be approximated for sufficiently 
large K q = r q f q by a Gaussian distribution: for i = 1, 2, D (Y q>k ) \ U k \i ~ TV (f-i q ,k\i, a l } k\i/ K q) > wnere 



a J if* = 0, 2 



oJ jfc , ift = 



1 We can relax the constraint on f q by requiring in (A3) instead block independence of the RVs {y q ,k[n]} n , and invoke for 
our derivations the central limit theorem valid for correlated (m-indepedend) RVs, as stated, e.g., in [341 Th. 2.8.1]; because 
of the space limitation, we omit these details here and stay within the original assumption (A3). 
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An explicit expression of the expected value E|/g fc| 4 above depends on the PUs' signaling features. For 
example, if the primary signals are drawn from a PSK modulation, then o~ qk ^ = cr qk ^2 cr| fe + o~ qk ^J [3]; 

whereas if the PU signaling is assumed to be Gaussian, then = fcf k ~^~ a qk) 0- 

The performance of the energy detector scheme are measured in terms of the detection probability 
P q A k(7q,ki T q ) - Prob {D(Y q , k ) > 7g ifc | %i,k} and false alarm probability P q ^ k (^ q ,k, r q ) = Prob{L»(Y 9jfc ) > 
lq,k I T~(-o,k} that, under the assumptions above, are given by 

P q % (7,,*, r g ) = Q (^J ^ ~ ~^ qM ° ) and P q % ( 7g , fc , r q ) = Q ( V^^— ^) , (3) 

where Q(x) = (l/\/2vr) e~ t2 l 2 dt is the Q-function. The interpretation of P q \ (7 g ,fc, r g ) and P qk (7 g ,fc, r g ) 
within the CR scenario is the following: 1 — P q \ signifies the probability of successfully identifying from 
the SU q a spectral hole over carrier k, whereas the missed detection probability -Pj™ lss = 1 — P q k rep- 
resents the probability of SU q failing to detect the presence of the PUs on the subchannel k and thus 
generating interference against the PUs. The free variables to optimize are the detection thresholds "fq^k's 
and the sensing times r 5 's; ideally, we would like to choose 7 9i fc's and T q s in order to minimize both P q ^ 
and -P,,™ 1SS , but ([3]) shows that there exists a trade-off between these two quantities that will affect both 
primary and secondary performance. It turns out that, 7 q,fc's and T q 's can not be chosen by focusing only 
on the detection problem (as in classical decision theory), but the optimal choice of 7^.^ and T q must be 
the result of a joint optimization of the sensing and transmission strategies over the two phases; such a 
novel optimization is formulated in Sec. [3l 

2.2 The transmission phase 

We model the set of the Q active SUs as a frequency-selective iV-parallel interference channel, where N is 
the number of available subcarriers; no interference cancellation is performed at the secondary receivers 
and the Multi-User Interference (MUI) is treated as additive colored noise at each receiver. This model is 
suitable for most of the CR scenarios, where the SUs coexisting in the network operate in a uncoordinated 
manner without cooperating with each other, and no centralized authority is assumed to handle the 
network access for the SUs. The transmission strategy of each SU q is then the power allocation vector 
p q = {p q .k}k=i over t ne N subcarriers (p q ,k is the power allocated over carrier k), subject to the following 
transmit power constraints 

P q ^L q eR N : X>,, fc <P„ 0<p 9 <p" ax |, (4) 

where we also included (possibly) local spectral mask constraints p™ 1 " = (p™^)/^ [the vector inequality 
in (j4| is component-wise]. 

Opportunistic throughput. The goal of each SU is to optimize his "opportunistic spectral utilization" 
of the licensed spectrum, having no a priori knowledge of the probabilities on the PUs' presence. According 
to this paradigm, each subcarrier k is available for the transmission of SU q if no primary signal is detected 
over that frequency band, which happens with probability 1 — P q %- This motivates the use of the aggregate 
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opportunistic throughput as a measure of the spectrum efficiency of each SU q. Given the power allocation 
profile p = (pq)S = i of the SUs, the detection thresholds -y q = (7g,jfc)fcLij and sensing time T q , the aggregate 
opportunistic throughput of SU q is defined as 

N 

R q (r„ p, lq ) = (i " S) E t 1 " P tk (7 9l fc, r,)] r g , fc (p) (5) 

fe=i 

where 1 — r g /T (r g < T) is the portion of the frame duration available for opportunistic transmissions, 
Pqlt (^q,ki T q) is the worst-case false alarm rate defined in Q, and r q> f. (p) is the maximum information rate 
achievable on secondary link q over carrier k when no primary signal is detected and the power allocation 
of the SUs is p. Under basic information theoretical assumptions, the maximum achievable rate r q ^ (p) 
for a specific power allocation profile pi^, . . . ,PQ t k is 

< \ i (m \H qq (k)\ 2 p q ,k \ , R s 

where {Hqq(k)} k=1 is the channel transfer function of the direct link q and {H rq (k)} k=l is the cross- 
channel transfer function between the secondary transmitter r and the secondary receiver q; and o~ qk is 
the variance of the background noise over carrier k at the receiver q (assumed to be Gaussian zero-mean 
distributed). 

Probabilistic interference constraints. Due to the inherent trade-off between P q \ and -P g ™ lss (cf. Sec. 
12. ip . maximizing the aggregate opportunistic throughput ([5]) of SUs will result in low P qk and thus large 
P 9 ™ 1SS , hence causing harmful interference to PUs (which happens with probability -P?)f s ). To control 
the interference radiated by the SUs, we propose to impose probabilistic interference constraints in the 
form of individual and/or global constraints. Individual interference constraints are imposed at the level of 
each SU q to limit the average interference generated at the primary receiver, whereas global interference 
constraints limit the average aggregate interference generated by all the SUs, which is in fact the average 
interference experimented by the PU. Examples of such constraints are the following. 

- Individual interference constraints: 

N 

E p ot B t*) ■ w i* ■ J*,* ^ j r x < (7) 

k=l 

- Global interference constraints, 

Q N 

E E P «T ^ r q ) ■ w q , k ■ p q , k < I max , (8) 

q=l k=l 

where I? 1 '" [or / max ] is the maximum average interference allowed to be generated by the SU q [or all the 
SU's], and Wg^s are a given set of positive weights. If an estimate of the cross-channel transfer functions 
{G q (k)} k=1 between the secondary transmitters q's and the primary receiver is available, then the natural 
choice of w qk is w qk = \G q {k)\ 2 , so that ([7]) and ([8|) become the average interference experienced at the 
primary receiver. In some scenarios where the primary receivers have a fixed geographical location, it may 
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be possible to install some monitoring devices close to each primary receiver having the functionality of 
(cross-) channel measurement and estimation of /™ ax 's. In scenarios where this option is not feasible and 
the channel state information cannot be acquired, a different choice of the weights coefficients w q kS and 
the interference threshold /™ ax in ([7]) and ([8]) can be made, based e.g. on worst-case channel/interference 
statistics; several alternative options are discussed in our companion paper [32], which is devoted to the 
design of distributed solution algorithms along with their practical implementation. 

Remark 1 (Individual vs. global constraints). The proposed individual and/or global interference 
constraints provide enough flexibility to explore the interplay between performance and signaling among 
the SUs, making thus the proposed model applicable to different CR scenarios. For instance, in the 
settings where the SUs cannot exchange any signaling, the system design based on individual interference 
constraints seems to be the most natural formulation (see Sec. I3.ip ; this indeed leads to totally distributed 
algorithms with no signaling among the SUs, as we show in the companion paper [32] , On the other hand, 
there are scenarios where the SUs may want to trade some signaling for better performance; in such 
cases imposing global interference constraints rather than the more conservative individual constraints is 
expected to provide larger SUs' throughputs. This however introduces a new challenge: how to enforce 
global interference constraints in a distributed way? By imposing a coupling among the power allocations 
of all the SUs, global interference constraints in principle would call for a centralized optimization. A 
major contribution of the paper is to propose a pricing mechanism based on the relaxation of the coupling 
interference constraints as penalty term in the SUs' objective functions (see Sec. . In the companion 
paper [32], we show that this formulation leads to (fairly) distributed best-response algorithms where the 
SUs can update the prices via consensus based-schemes, which requires a limited signaling among the SUs, 
in favor of better performance. 

3 Joint Optimization of Sensing and Transmissions via Game Theory 

We focus now on the system design and formulate the joint optimization of the sensing parameters and 
the power allocation of the SUs within the framework of game theory. Motivated by the discussion in 
the previous section (cf. Remark [1]), we propose next two classes of equilibrium problems: i) games with 
individual constraints only; and ii) games with individual and global constraints. The former formulation 
is suitable for modeling scenarios where the SUs are selfish users who are not willing to cooperate, whereas 
the latter class of games is applicable to the design of systems where the SUs can exchange limited 
signaling in favor of better performance. At the best of our knowledge, both formulations based on the 
joint optimization of sensing and transmission strategies are novel in the literature. We introduce first 
the two formulations in Sec. 13.11 and Sec. 13.21 below, and then provide a unified analysis of both games. 

3.1 Game with individual constraints 

In the proposed game, each SU is modeled as a player who aims to maximize his own opportunistic 
throughput R q (p, "f q , r 9 ) by choosing jointly a proper power allocation strategy p q = (pq,k)k=v detection 
thresholds 7 ? = (7 9 ,fc)fcLi> an d sensing time T q , subject to power and individual probabilistic interference 
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constraints. Stated in mathematical terms, player q's optimization problem is to determine, for given 

P-q ~ {(Pr,k) k=1 ) q ^ r=1 , a tu P le ( T 9' P<?> 1 q) SUch that 











subject to 


iV 






(a) 


Z^ P q,k {1q,k,r q ) -Wq^-p^k < I q , 
k=l 




(9) 


(b) 


P ' q %l q ,k,Tq) < Pq, k , and P^hq^Tq) < a q>k , Vfc = 1, • • 






(c) 


Pg G P g and r 9 min < r 9 < r g max . 







In Q we also included additional lower and upper bounds of r q satisfying < r™ m < r™ ax < T q 
and upper bounds on detection and missed detection probabilities < a Q:k < 1/2 and < /3 qk < 1/2, 
respectively. These bounds provide additional degrees of freedom to limit the probability of interference 
to the PUs as well as to maintain a certain level of opportunistic spectrum utilization from the SUs 
[1 — P qk > 1 — f3q t k\. Note that the constraints a q>k < 1/2 and (3 Qtk < 1/2 do not represent a real loss of 
generality, because practical CR systems are required to satisfy even stronger constraints on false alarm 
and detection probabilities; for instance, in the WRAN standard, a q ^ k = (3 q ^ k =0.1 |33| . 

3.2 Game with pricing 

We add now global interference constraints to the game theoretical formulation. In order to enforce 
coupling constraints while keeping the optimization as decentralized as possible, we propose the use of 
a pricing mechanism through a penalty in the payoff function of each player, so that the interference 
generated by all the SUs will depend on these prices. Prices are thus addition variables to be optimized 
(there is one common price associated to any of the global interference constraints) ; they must be chosen so 
that any solution of the game will satisfy the global interference constraints, which requires the introduction 
of additional constraints on the prices, in the form of price clearance conditions. Denoting by ir the price 
variable associated with the global interference constraint ([8]), we have the following formulation. 

Player q's optimization problem is to determine, for given p_ g and ir, a tuple (r q , p q , ~f q ) such that 

N 

maximize R q (r q , p, J q ) - tt ■ V] P^ilqfi, T q ) ■ w q>k • p q , k 
Tq ' Pq ' lq fc=i (10) 

subject to constraints (a), (b), (c) as in ([9]). 

The price obeys the following complementarity condition: 

N Q 

< TT 1 / maX - £ £ P ? T S (7 9 ,fc, Tq) ■ W q , k ■ Pq , k > 0. (11) 
k=l g=l 

In (fTTj) . the compact notation < a _L b > means a > 0, b > 0, and a ■ b = 0. The price clearance 
conditions (Hip state that global interference constraints ([8]) must be satisfied together with nonnegative 
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price; in addition, they imply that if the global interference constraint holds with strict inequality then the 
price should be zero (no penalty is needed). Thus, at any solution of the game, the optimal price is such 
that the interference constraint is satisfied. In the companion paper |32| . we show that a pricing-based 
game as (|10p -(fTTj) can be solved using best-response iterative algorithms, where the price ir is updated 
by the SUs themselves via consensus schemes; which require a limited signaling exchange only among 
neighboring nodes. In Sec. [7J we show that when the SUs are willing to collaborate in the form described 
above, they reach higher throughputs, which motivates the formulation (|10p -(fTTj). 

Note that when the price tt is set to zero, the game (fT0|) -( fTT|) reduces to the NEP in ([9]) where only the 
individual interference constraints ([7]) are imposed. Therefore, hereafter we focus only on (fT0|) -(fTT]). as a 
unified formulation including also the game © as special case (when ir = 0) . Note that when we deal with 
the general formulation ([10p -(fTT|). we make the blanket assumption that none of the power/interference 
constraints are redundant. 

3.3 Equivalent reformulation of the games and main notation 

Before starting the analysis of the proposed games, we rewrite the players' optimization problems (|10p 
with the side constraint (jlip in a more convenient and equivalent form. The principal advantage of the 
reformulation is that it converts the nonlinear constraints (b) and (c) into linear constraints; thus (i) 
facilitating the application of the VI approach to the analysis of the game, and (ii) shedding some light 
on the interpretation of the new concepts and results that will be introduced later on. 

At the heart of the aforementioned equivalence is a proper change of variables: instead of working with 
the original variables 7 g> fc's and T q 's, we introduce the new variables 7g,A;'s and T q s, defined as 

(lq,k, T q ) ^ (% t k, %) ■ % = y/ T q f q and % >k = T q f q — -^12. (12) 

VqMO 

Note that the above transformation is a one-to-one mapping for any 7 g ^ > and T q > 0; we can thus 
reformulate the games using the new variables without loss of generality. Moreover, a property of (112p 
is that the constraints on P qk { n fq,kt T q) an d -P™fc SS (7<?,fc> r <?) m each player's optimization problem [see (|9])] 
become linear in the new variables 7^ and T q : for each k = 1, . . . , N, we have 

P^k^Tq) < Pq,k, %,k > Q" 1 (Pq,k) 

( \~ (13) 

Pgk Uq,k,T q ) <a q>k , <y [L — a q< k) , 

where Q~ l (•) denotes the inverse of the Q-function [Q(x) is a strictly decreasing function on R]. Using the 
above transformation, we can equivalently rewrite the false-alarm rate Pq k { r iq,k: T q)i the missed detection 
probability P^k 8 {lq,ki T q)i an d the throughput R q (r q , p, y q ) of each player q in terms of the new variables 
l q - (%k)k=i s and ^?' s ' denoted by P q %{%,k,T q ), P™ k lss (%,k,r q ), and R q {r q , p, j q ), respectively; the 
explicit expression of these quantities is: 

Pq%(%,k, Tq) = = Q(%k) (14) 
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pmiss/ ^ \ Smiss/- ^ \ A ^ ( °g,fc ^<?> fc ( ^g.fc 1 Mg,fc|o) r g ^ n -s 

U,k VYq,k>T q ) = r k {J qi k,r q ) = y (15) 

V <?q,k\l / 

/ -2 \ AT 

E 9 (r g , p, 7 ,) = i?g(r 9 , p, %) 4 ( 1- j|r ) E I 1 ~ ^)) r ^ (p) • ( 16 ) 



q J- q 



k=l 



Using (|13p -(fT6|). the game (|10p -(fTT|) in the original players' variables [r q , p q , 7 ? ) 's [and thus also ([9])] can 
be equivalently rewritten in the new variables (r ? ,p ? ,7„)'s as : 



Players' optimization problems. The optimization problem of player q is: 

N 

maximize R q (? q , p, 7 „) - vr • V] P™ ss (7, fc , f ? ) • io 5 fc • p q , k 

r q , Pq ,f q ^ 

subject to 

N 

(a) E ^S" ? ^ ■ w ^ ■ ^ C* (17) 
k=l 

(b) <^ 7 9 , fc > ! ! ! < a qyk > Vk = l,...,N, 

I °"g,fe|l J 

(c) p q G and f 9 min < r q < r g max , 

Price equilibrium. The price obeys the following complementarity condition: 

TV Q 

< vr J_ / max -EE *<3" ■ w ^ ■ ^ ( 18 ) 

fc=l 9=1 



where S,, fc 4 Q~\\ - a q>k ), P q>k 4 Q' 1 ^), ?™ in 4 ^/ g r ? mi », and r g max 4 ^7™; and (a), (6), and 
(c) in (|17p are the image under the transformation (112p of (a), (b), and (c) in ([9]), respectively [see (j!2J) ]. 

Note that (even in the transformed domain), players' optimization problems in ()17p are still noncon- 
vex, with the nonconvexity occurring in the objective function and the interference constraints (a); the 
constraints (b) and (c) are instead linear and thus convex; the polyhedra in (b) may be however empty. 
To explore such a structure, as final step, we rewrite the game (|17p - (fT8|) in a more compact form, making 
explicit in the feasible set of each player the polyhedral (convex) part [constraints (b) and (c)] and the 
nonconvex part [constraint (a)]. Let us denote by X q the feasible set of player q in (fT7|) . given by 

X q 4 {(r q , Pq , %) e y q I I q (f q , p q , %) < 0} (19) 

where we have separated the convex part and the nonconvex part; the convex part is given by the poly- 
hedron y q corresponding to the constraints (6) and (c) 

a q ,k\o 1q,k — ( H q ,k\i — V q ,k\o ) T q 

W ~ "' \, (20) 

P, € V q , < Tg < f, maX j 

whereas the nonconvex part is given by the constraint (a) that we have rewritten introducing the local 
interference violation function 



[Tq, Pq, 7g) I 7<7,fc > P 9 ,fc, — < «<?,*:, VK = 1, . . . ,N 
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N 

I q (r q , Pq ,%)^Y, P ^ S (%,k,r q )-w q , k -p q , k - I™*, (21) 
k=l 

which measure the violation of the local interference constraint (a) at (r q , p q , 7 ). Similarly, it is conve- 
nient to introduce also the global interference violation function I(t, p, 7): 

N Q 

1(9, P, 7) = E E P £T ^ ■ ■ ~ /maX ' ( 22 ) 

k=l q=l 

which measures the violation of the shared constraint at (r, p, 7). 

Based on the above definitions, throughout the paper, we will use the following notation. The convex 
part of the joint strategy set is denoted by y = Yl q= x y q , whereas the set containing all the (convex part of) 
players' strategy sets except the q-th. one is denoted by y~ q — Wr-t^r^ similarly, we define X = JI^Li %q 
and X_ q = nr^g^- For notational simplicity, when it is needed, we will use interchangeably either 
(Tq, p q , 7„J or Xg = (r q , Pq, 7„) to denote the strategy tuple of player q; similarly, x = (r, p, 7) = 
(x 9 )^ =1 will denote the strategy profile (r, p, 7) of all the players, with r = (r q )® =1 , p = (p q )® = i, 
and 7 = {l q )q—i, whereas the strategy profile of all the players except the q-th one will be x_ f; = 
(r r , p r , 7 r )^=i rz A g - Using the above definitions, game (|17p -(fT8|) can be rewritten as 



Players' optimization. The optimization problem of player q is: 




maximize R q (x g , x_ g ) — tt ■ I(x q , x_ 9 ) 


(23) 




subject to x g = (r q , p, 7 ? ) G X q . 




Price equilibrium. The price obeys the following complementarity condition: 




< vr _L -I (x) > 0. 


(24) 



Given the equivalence between (|17p -(fl~8|) and (|23p -(p4|). in the following we will focus on the game in 
the form (|23l) - (|24| ) w.l.o.g.. For future convenience, Table [T] collects the main definitions and symbols used 
in d23Dd21D- 

4 Solution Analysis 

This section is devoted to the solution analysis of the game (|23|) - ff24| ) . It should be noted that except for 
the recent work |31| . no existing theory to date is applicable to analyze this game. We start our analysis 
by studying the feasibility of each optimization problem in (|23|) . We then introduce the definitions of NE, 
LNE, and the relaxed concept QNE, and establish their main properties. 

4.1 Feasibility conditions 

The feasibility of the players' problems (|23p requires that all the polyhedra y q defined in (|20p be nonempty; 
the following are necessary and sufficient conditions for this to hold. Introducing the SNR detection 
snr^ fc = ct| k / a qk experimented by the SU q over carrier k and using the definitions of o~ q ^i and o~ q ^ 
as given in Sec. 12. 1| the feasibility requirements are: 
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Table 1: Glossary of the main notation of game (|23p -([24|) 



Symbol 




Meaning 






sensing time of SU q 


Iq = (lq,k) 




detection thresholds of SU q over carriers k = 1, . . . , N 


Pq ~ (Pq,k)k=l 




power allocation vector of SU q 


IT 




scalar price variable 


% 




(transformed) sensing time of SU q [l)12p] 


Iq = (%k) 




(transtormed) detection thresholds oi bU g over carriers k = 1, . . . , iv [(|12J)J 






/, n i\ • 111 j • 1 1 • 1 * j POTT " 7 T P /I "1 1*" Y. 1 

(transformed) missed detection probability of bU q on carrier k [cf. (|15|)] 


x <? = { T qi P<Ji 7<j) 




strategy tuple of bU g (m the transformed variables) 


x — q — (jr, p r , 'Y r ) r ^q 


strategy profile of all the SUs (in the transformed variables) except the q-th. one 


X = (XgjJLi = (T, P , 


7) 


strategy profile of all the SUs (in the transformed variables) 


Pq ( x ij) x — q) 




opportunistic throughput of SU q in the transformed variables [cf. (fl"6|)] 


Iq( x q) 




local interference constraint violation of SU q [cf. (l2~H[ 


/(x) 




global interference constraint violation of SU q [cf. (|22[)] 


^ — llq=l <*q 




feasible set of SU q [cf. (|19p]. joint feasible strategy set 


X -q = Ylr^q X r 




joint strategy set of the SUs except the g-th one 


yq,y = uU y ^ 




convex part of X q [cf. (|20p]. Cartesian product of all 3V S 




Or 


1 (^) + |Q- 1 fe)lK fe|1 K fc | ) ) Vfc = lj ___ jJV) V , = 1,...,Q. (25) 



The conditions above have an interesting physical interpretation that sheds light on the relationship 
between the achievable sensing performance and sensing/system parameters. It quantifies the trade-off 
between the sensing time (the product "time-bandwidth" / g r™ ax of the system) and detection accuracy: 
the smaller the required false alarm and missed detection probabilities, the larger the number of samples 
and thus r q to be taken. 



4.2 NE and local NE 

The definitions of NE and local NE of a game with price equilibrium conditions as (|23|) -([24p are the 
natural generalization of the same concepts introduced for classical noncooperative games having no side 
constraints (see, e.g., \27 \ l29" l [39]) and are given next. 

Definition 2. A Nash equilibrium of the game A23\) with side constraints \21$ is a strategy-price tuple 
(x*, 7T*), such that 

x* G argmax{ J R g (x g ,x* (? )-7r*-7(x g ,x%)}, Vg = 1, • • • , Q, (26) 

an d , ^ / \ 

< it* _L - J(x*) > 0. (27) 

A local Nash equilibrium is a tuple (x*,7r*) for which an open neighborhood M q o/x* exists such that 

x* G argmax |i? 9 (x 9 ,x* ) -vr* -/(xg,x* )) , Vq = l,--,Q, (28) 
x q ex q nN* ^ > 
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0.4 0.6 
Power allocation 

Figure 1: Example of nonconvex feasible set of a SU for fixed sensing time and N = 1. 
and (E?Jj holds. A NE (LNE) is said to be trivial ifp* = for all q = 1, . . . , Q. 

Note that a NE is a LNE, but the converse is not necessarily true. The feasibility of the optimization 
problems in (|23p [cf. (|25|) ] is not enough to guarantee the existence of a (L)NE. The classical case where 
a NE exists is when the players' objective functions (to be maximized) are (quasi-)concave in their own 
variables given the other players' strategies, and the players' constraint sets are compact and convex and 
independent of their rivals' strategies (see, e.g., |27[ 140] ); other cases are when the game has a special 
structure, like potential or supermodular games. The game (I23p with side constraints (I24p has none of 
such properties; indeed, each player's optimization problem in (f23|) is nonconvex with the nonconvexity 
occurring in the objective function and the local/global interference constraints; moreover, the set of 
feasible prices [satisfying (f24"|) ] is not explicitly bounded [note that these prices cannot be normalized due 
to the lack of homogeneity in the players' optimization problem (I23p ]. Figure Q] shows an example of 
feasible set of the detection threshold and power allocation of a SU, for a given sensing time and N = 1; 
the set is nonconvex, implying the nonconvexity of the SUs' optimization problems in (j23|) . 

The existence of a (L)NE under the nonconvexity of the players' optimization problems and the un- 
boundedness of the price variables is in jeopardy; in fact, the game may not even have a NE, or the NE 
may exist under some abstract technical conditions (that are not easy to be checked^]) requiring a partic- 
ular profile of the system parameters (e.g., channel conditions, interference level, SNR detection, etc.). 
This would make the system behavior (e.g., equilibrium existence versus nonexistence, convergence of algo- 
rithms versus nonconvergence) unpredictable, which is not a desirable feature in practical applications. To 
overcome this issue, in this paper, we propose the use of a (relaxed) equilibrium concept— the QNE— which 
will be show to alway exist (even when the NE fails). The QNE along with its main properties and 

optimality interpretation is introduced next. 

2 Conditions for the existence of a NE are established in [261 Th. 1] for NEPs (with no pricing), which may have 
discontinuous and/or nonquasi-concave payoff functions, but satisfying the so-called "diagonal transfer quasi-concavity" and 
"diagonal transfer continuity"; the case of noncompact strategy sets have been addressed in [411 Th. 5] (see also references 
therein). Despite their theoretical importance, the verification of these conditions for payoff functions and sets arising from 
realistic applications such those in this paper does not appear possible. 
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4.3 Quasi-Nash equilibrium 

For a nonlinear program constrained by finitely many algebraic equations and inequalities and a differen- 
tiable objective function, stationarity is defined by the well-known Karush-Kuhn- Tucker (KKT) conditions 
that are necessarily satisfied by a locally optimal solution under an appropriate Constraint Qualification 
(CQ), see |42l Prop. 1.3.4]; solutions of the KKT system are called stationary solutions of the associated 
optimization problem. In the context of nonconvex problems, the common approach well-accepted in 
practice, is then to look for a stationary (possibly locally optimal) solution. Here we apply this approach 
to nonconvex games and introduce the concept of QNE of the nonconvex game (|23p -([24|). defined as a 
stationary solution of the game, which is a tuple that satisfies the KKT conditions of all the players' op- 
timization problems (123|) along with the price equilibrium constraints (124j) . The prefix "quasi" is intended 
to signify that a NE (if it exists) must be a QNE under mild CQsjf) A direct study of the main properties 
of the QNE (e.g., existence and uniqueness) based on the KKT conditions of the game is not an easy 
task; thus to simplify the analysis, we rewrite first the aforementioned KKT conditions as a proper VI 
problem |42] ; then, building on VI tools, we provide a satisfactory characterization of the QNE. It should 
be noted that unlike a single optimization problem where a stationary solution must exist (under a CQ) 
if an optimal solution exists, this is not the case for a game because a QNE is the result of concatenating 
the KKT conditions of several optimization problems. 

At the basis of our approach there is an equivalent and nontrivial reformulation of the necessary con- 
ditions for a tuple (x*, 7r*) to be a (L)NE of (I23P - ((24| ). which explores the structure of the players' feasible 
sets X q , as described next. The classical approach to write the KKT conditions of each player's opti- 
mization problem would be introducing multipliers associated with all the constraints in the set X q — both 
the convex part y q and the nonconvex part I q (x q ) < [cf. (|19p ]— and then maximizing the resulting 
Lagrangian function over the whole space (i.e., considering an unconstrained optimization problem for 
the Lagrangian maximization). The approach we follow here is different: instead of explicitly accounting 
all the multipliers as variables of the KKT system, for each player's optimization problem, we introduce 
multipliers only for the nonconvex constraints {I q (x q ) < 0, q = 1, . . . , Q} [i.e., (a)], and retain the convex 
part y q as explicit constraints in the maximization of the resulting Lagrangian function. The mathemat- 
ical formulation of this idea is given next. Denoting by X q the multiplier associated with the nonconvex 
constraint I q (x q ) < of player q, the Lagrangian function of player q is: 

£ 9 (x, A g ,7r) = R q (x q ,x_ q ) - n • i"(x 9) x_ g ) - X q ■ I q (x q ), (29) 

which depends also on the strategies x_ f/ of the other players and the price tt. Building on Definition 
[2] it is not difficult to see that if (x*,7r*) is a (L)NE of (|23]) - (|24l) and some CQ holds at x*, there exist 
multipliers A* = (X*) q=1 associated with the local nonconvex constraints {/^(xq) < 0, q = 1, . . . , Q} such 

3 Note that without some CQs being satisfied, the KKT conditions may not be even valid necessary conditions of optimality 
for the optimization problems (|23p . making the QNE meaningless. 
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that the tuple (x*,7r*, A*) satisfies 



(i) : x* 6 argmax{£g(xg,x* g , A*,7r*)} , Vg = 1, ■ ■ ■ , Q, 



(ii) : 

(iii) : 



< vr* _L - J(x*) > 
< A* _L - I q (Xq) > 0, Vg = l,...,Q. 



(30) 



Note that each Lagrangian maximization in (i) is constrained over the convex part y q of the player's local 
constraints X q . Since y q is a convex set, we can invoke the variational principle for the optimality of x* 
in (i), and obtain the following necessary conditions for (f30~|) to hold: 

(i'): (x g -x*) T (-V x ,£ 9 (x ? ,x* g ,A*,7r*)) > Vx g E y q and Vg = 1, • • • , Q, 

(ii'): (tt - tt*) • (- I(x*)) > 0, Vtt e R+, (31) 

(iii') : (X q - A*) • (- / g (x*)) > 0, VA g E R+ and Vg = 1, . . . , Q, 

where (i ) is just the aforementioned first-order optimality condition of the (nonconvex) optimization 
problem in (i); and (ii )-(iii ) are equivalent to (ii)-(iii). Finally since the variables in (i ) — (iii ) are not 
coupled each other by any joint constraint, we can equivalently rewrite the three separated inequalities 
(i ) — (iii ) as one inequality, obtained just summing (i ) — (iii ). More specifically, ([31"j) is equivalent to 

T 



( x-x* \ 

TT — TT 

V A — A* / 



/-(V x A(x*,^A*,))? =1 \ 



-J(x*) 



Q 



>0, V(x,vr,A) E JJ%xM + 

9=1 



(32) 



= *(X*,7T*,A*) 



The above system of inequalities defines the so-called VI problem in the variables (x, 7r,A), whose 
vector function is \1/ (x, tt, A) and feasible set is S, both defined in (|32p Pl such a VI is denoted by VI(5, \l/). 
It follows from the implications (j30[) =?> (|32f) that the VI (5, \&) is an equivalent reformulation of the KKT 
conditions of the nonconvex game (|23p -([2l|). wherein the convex constraints y q 's (and thus the associated 
multipliers) have been absorbed in the VI set S. The VI(5, S&) is indeed composed of three sets of variables 
only: i) the players' decision variables x = (r, p, 7); ii) the multipliers A = (\ q )® =1 associated with the 
local nonconvex constraints {I q (x q ) < 0, q = 1,...,Q}; and iii) the price tt; there are no multipliers 
associated with the constraints y q s. The above discussion is made formal in the following lemma. 
Lemma 3. The KKT conditions of the game A23\)-(24\) are equivalent to the VI (S, The equivalence 
is in the following sense: 

- Suppose that (x*,7r*, A*, fx*) is a KKT solution of the game, with A* = (A*)2 =1 and fi* = (/x*)^ =1 being 
the multipliers associated with the players' nonconvex constraints {I q (x q ) < 0, q = 1,...,Q} and 

4 The VT(<S, *) problem is to find a point z* G S, the solution of the VI, such that (z - z*) T *(z*) > for all z e S [42]. 
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convex constraints y q 's, respectively. Then, (x*,7r*, A*) is a solution of the VI (S, 
- Conversely, suppose that (x, 7f, A) is a solution of the VI (S, Then, there exists JZ = (jl q ) q=1 such 
that (x, ?f, A, /l) is a KKT solution of the game, with JZ = (jZ q )® =l being the multipliers associated 
with the players ' convex constraints y q 's. 

Given the connection between a VI problem and the KKT system of the game, the QNE of the game 
can also be interpreted as solutions of the VI (S, \&), which motivates the following formal definition of 
QNE of the nonconvex game (f23>([24)) . 

Definition 4. The quasi-Nash equilibria (QNE) of the game ^23\) with side constraints (Efy are the 
solutions (x*, 7r* , A*) of the VI(S, A QNE is said to be trivial if pt = for all q = 1, . . . , Q. 

Our concept of QNE is conceptually similar but formally different from other forms of local equilibria 
introduced in the literature, such as the critical NE |26| or the generalized equilibrium |30| . The former is 
indeed a feasible strategy profile of the players at which the gradients of each player's objective function 
(taken with respect to that player's strategy) are vanishing; the vanishing property of the gradient is 
typically not satisfied by solutions of constrained optimization problems, let alone equilibria of inequality 
constraint games. The latter equilibrium concept |30] is defined as the solution of an abstract set value 
inclusion, whose fruitful application to realistic games like those proposed in this paper is seriously ques- 
tionable. In contrast, our Vl-based definition of local equilibrium opens the way to the use of the VI 
machinery |42| . both theory and methods, to successfully study the QNE of realistic games. The line of 
analysis we are going to illustrate, based on |31] , is in fact a new contribution in the literature of solution 
analysis of local equilibria. 

Optimality interpretation of the QNE. Since the QNE is a stationary solution of the game, it is 
desirable to understand whether the QNE has some (local) optimality properties. Interestingly, the QNE 
has the following equivalent interpretation: the triplet (x*,7r*, A*) is a QNE of the game (I23l) -( |24"j) if and 
only if it is an optimal solution of the players' optimization problems in the following sense [recall that 
x* = K)?=i with x* = (t* p*, %), and x*_ q = (7?, p*, 7r) r ^] : 

(a) (NE power allocation for fixed optimal thresholds and sensing times) The tuple p* is a NE of the SUs' 

game when f = t* and 7 = 7*; for each q = 1, • • • , Q, 

p* G argmax <R q (r£, p q , %x*_ q ) - vr* • I(t%, p q , %,x*_ q )\ 
Pi 1 J 

subject to (7* p q , 7*) G X q , ( 33 ) 

(b) ( Threshold optimization for fixed optimal power allocations and sensing times) The threshold vector 

7* is an optimal solution of the q-th. players' optimization problem when r = t* and p = p*: for 
each q = 1, ■ ■ ■ , Q, 

% £ argmax {R q (r q , p* x* q ) - vr* • /(?*, p*, x* J 



subject to (tJ, p* 7,) G X q ; 
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(c) (Sensing time optimization for fixed optimal power allocations and thresholds) The sensing time is 

an optimal solution of the q-th players' optimization problem when 7 9 = 7* and p = p*: for each 
q = !,■■■ ,Q, 

T* q G argmax { R q (r q , p*, 7* x* g ) - vr* • I(r q , p*, 7* x* q ) \ 

subject to (r„ p*, %) G ( 35 ) 

(d) (Price equilibrium) The complementarity condition < 7r* _L — /(x*) > holds; 

(e) (Common multipliers for individual interference constraints) For each q = 1, • • • , Q, there exists a 

common optimal multiplier tuple A* associated with the individual interference constraints {I q (x q ) < 
0, q = 1, . . . , Q} at x*. 

In words, at a QNE (x*,7r*, A*) we have that: i) each user unilaterally maximizes his own function with 
respect to each of his own strategies t£, p*, and 7* separately, while keeping the rivals' strategies fixed at 
the optimal value [statements (a)-(c) above]; ii) the optimal price value ir* satisfies the complementarity 
condition (|24p [statement (d) above]; and iii) A* is the common optimal multiplier for all the players 
associated with the individual interference constraints in (|33p ~([35|) [statement (e) above]. Note that (|33[) 
is a linearly constrained concave maximization problem; thus multipliers exist for this problem. Problems 
(|34p and (|35p are concave maximization program with convex constraints fulfilling the Slater CQ for a 
fixed but arbitrary nonzero p^. Thus they also have constraint multipliers. Moreover, the constraint 
sets of problems (|33p . (|34p . and (|35p are bounded. A key requirement in the QNE definition is therefore 
condition (e) that stipulates the existence of a set of common multiplier tuple A* for the individual 
interference constraints, which is not automatically guaranteed. Sec. 14.51 focuses on this issue. 

4.4 Connection between LNE and QNE 

First of all, note that a (L)NE (x*, ir*) is composed of the sensing/transmission strategies of the players as 
well as the price, whereas a QNE is a tuple (x*, ir* , A*) consisting also of the KKT multipliers A* of the 
game's nonconvex constraints {I q (x. q ) < 0, q = 1,. . . ,Q}- To explore the connection between a LNE and 
a QNE we need to show that, under the feasibility and solvability of all the players' optimization problems 
in (|23p (cf. Sec. I4.ip . the KKT conditions of the game are valid necessary conditions of optimality for 
these problems. To this end, we need to verify that an appropriate CQ holds. In this paper, we will use 
the Abadie's CQ (ACQ) |42[ Sec. 3.2]. Not explicitly mentioned, this CQ is essentially the key to show 
the validity of the following result, which states that every LNE must be a QNE in this game. 

Proposition 5. Given the game i23\) with (possibly) side constraints \2J$ , the following holds: z/(x*,7T*) 
is a LNE of the game then A* exists such that (x*,tt*, A*) is a QNE. 

Note that the converse of Proposition [5] in general is not true; for a QNE to be a LNE, one needs 
appropriate second-order sufficiency conditions. The analysis is quite involved and goes beyond the scope 
of this paper; we refer the interested reader to |31[ Proposition 7] for details. In the companion paper [32], 
we derive sufficient conditions for (a special case of) the game to have a unique QNE, which then must 
coincide with a (L)NE. Finally, recall that, since the set S of the VI(5, \l/) is unbounded, the existence of 
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a QNE (a solution of the VI) is not automatically guaranteed. The study of existence and boundedness 
of the QNE is addressed in the next section. 

4.5 Existence of a QNE 

We can now study the existence of a QNE of the game (|23p -([24|). via the solution analysis of the VI (<S, 
The theorem below states that the game (|17p . if feasible, always admits a non trivial QNE. 

Theorem 6. Suppose that the set S in \32\) is nonempty [i.e., the feasibility conditions i25\) hold true]. 
Then, the VI(S, has a nonempty bounded solution set; thus the game t23\) with side constraints \2J$ 
has bounded QNEs. Moreover, every QNE (and thus LNE) is non trivial. 

Proof. See Appendix. □ 

Note that for the game in ([9]), where there are no global interference constraints (i.e., tt = 0), it is 
not difficult to show that every QNE (and thus LNE) is such that p* ^ for all q = 1, . . . , Q. In the 
presence of side constraints, instead, this is not obvious, because of the potential unboundedness of the 
prices tt (if the interference constraints are "too stringent", implying large prices, the users may not be 
allowed to transmit at all]. Interestingly, sufficient conditions for every QNE (LNE) of the game with side 
constraints (|24p to have p* 7^ for some/all q = 1,...,Q can be derived (we omit the details because 
of the space limitation). These conditions have a physical interpretations: they impose an upper bound 
on the (normalized) secondary-to-primary cross-channels \G„(k)\ 2 , so that a bounded set of equilibrium 
prices exists such that the interference constraints at the primary receiver can be satisfied with nonzero 
transmissions of the SUs. 

5 The Equi-sensing Case 

The decision model proposed so far is based on the assumption that only the PUs' signals are involved 
in the detection process performed by the SUs, implying that the SUs are somehow able to distinguish 
between primary and secondary signaling. This can be naturally accomplished if there is a common sensing 
time (still to optimize) during which all the SUs stay silent while sensing the spectrum. However, the joint 
optimization of the sensing and transmission parameters proposed in Sec. [3] in general, leads to different 
optimal sensing times of the SUs, implying that some SU may start transmitting while some others are still 
in the sensing phase. Since the energy based detection as proposed in Sec. 12. H is not able to discriminate 
between sources of received energy, this in-band interference generated by the transmitting SUs would 
confuse the energy detector and thus introduce a significant performance degradation. To overcome this 
issue two different directions can be explored, as detailed next. 

A first approach could be using more sophisticated signal processing techniques for the SUs' sensing 
that look into a primary signal footprint (e.g., modulation type, data rate, pulse shaping, or other signal 
feature) to improve the detector robustness, at the cost of increased complexity. This would allow the 
SUs to differentiate between primary signals, background noise, and interference and thus possibly benefit 
from adaptive signal processing for canceling SU interferers. Depending on what a priori knowledge of the 
primary signal is known to the SUs, different feature detectors can be applied under different scenarios and 
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complexity requirements |43j . For example, ATSC digital TV signal has narrow pilot for audio and video 
carriers; CDMA systems have dedicated spreading codes for pilot and synchronization channels; OFDMA 
packets have preambles for packet acquisition. Under such a priori information, the optimal detection 
is given by the matched filter \36\ fl3] . We leave the reader the easy task to extend the proposed game 
theoretical formulation to the case in which the energy detector is replaced by the pilot-based matched filter 
(note that, under mild conditions, the performance of the matched filter are still given by Q, but with a 
different expression for the fJ>q,k\i s an d cr q: k\i s [43j ) . Note, however, that the better sensing performance 
of the matched filter are obtained at the cost of additional hardware complexity: the SUs would need a 
dedicated receiver for every PU class. 

The second approach we propose is suitable for scenarios where feature detection is not implement able, 
and thus the energy detector is the only available option (see also Sec. [6] for a more general energy detector- 
based scheme). In such a case, the only way for the SUs to distinguish the primary from the secondary 
signals is to avoid overlapping secondary transmissions during the sensing phase. This can be done by 
"forcing" the same sensing time for all the SUs, which still needs to be optimized. To do that while keeping 
the distributed nature of the optimization, we propose to modify the original games as follows. 



Players' optimization. The optimization problem of player q is: given c > 

maximize R q (x q , x_ 9 ) - vr • I(x) - \ - I g (3g) 

subject to x g = (? q , p q , J q ) G X q . 
Price equilibrium. The price obeys the complementarity condition 



The difference with respect to the previous formulations is that now in the objective function of each 
player there is an additional term that works like a penalization in using different sensing times for the 
players. Because of this penalization, one would expect that, for sufficiently large c, the equilibrium of the 
game tends to have equal (normalized) sensing times Tq/^/Tg's (and thus equal r g 's, since f q j \J~fq = ^q- 
see (j!2p ], provided that such a common value is feasible for all the players' optimization problems; this 
solution indeed is the one that minimizes the loss induced by the penalization in the payoff function of 
each player. This intuition is formalized in Theorem [7] below. 

Feasibility conditions. The first step is to derive (sufficient) conditions guaranteeing the existence of a 
common value for the sensing times T q s. We have the following: For every 7 = (^ q )Z = i for which there 
exist a p = (p 9 )^ =1 and f = (r q ) q=1 satisfying the feasibility conditions (b) and (c) of the original game 
in (fTTl) . there must exist a common r such that, for all q = 1,...,Q and k = 1, . . . N, we have 



imii -max 



^<V^<^, and ^^-(^H-^lo)^ < Sgfc . (37) 

\/fq V fq (T q,k\ 1 

The first set of conditions in (I37p simply postulates the existence of an overlap among the (normalized) 
sensing time intervals [T^^/y^Tg, T™ ax /y^], which is necessary to guarantee the existence of a common 
value for the sensing times in the original variables T q s. The second set of conditions guarantee the 
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existence of a common r for the values of feasible 7 that are candidates to be QNE of the original game 
(|17p . Note that this is less than requiring r to satisfy also the interference constraints (a) in (fTT|) for all 
feasible 7 and p. 



Theorem 7. Given the game in L'J6}) with side constraints (Ej), suppose that the feasibility conditions in 



\31\) are satisfied. Let {c v }'^L 1 be any sequence of positive scalars such that \\m. u ^ +00 c u = +oo ; and let 
(x^7r^, \ v ) be a QNE of the game for c = c u . Then, the sequence {(x^ tt u , X u )} ( ^ =1 has a limit point, 
denoted by (x°°, 7r°°, A°°), where x°° = (t°°, p°°, 7°°); for every such a limit point, the following hold: 

(i) There exists a feasible t* such that 

q ~ r -V^, Vr,g = l,...,Q, andr^q; (38) 



(ii) t* in A38\) has the following optimality properties: 



= argmax jfj R g (y/J^, p°°, 7°°) - tt 00 • I ((V^)^ > P°°. 7°°) | 



subject to ( VXr, p°°, tT) e ^9' Vg = 1, . . . , Q. 



(39) 



Note that (|38p states that in the limit a// the original sensing time variables r q must be equal to r* 
[recall that f q j ' yfjq = ■\f 7 Tqi see (1 1 2[) ] . implying that there always exists a sufficiently large c such that one 
can reach a QNE of game (|36p having sensing times that differ from their average by the desired accuracy. 
Moreover, such a common sensing time r* is optimal in the sense given by ([39]): r* is the unique maximizer 
of the sum (priced) throughput of the original game (jlOh . satisfying the price and interference constraints, 
while keeping the players' powers, thresholds, and price fixed to p°°, 7^°, and 7r°°, respectively. In the 
companion paper [32], we focus on distributed algorithms to compute such limit points. 

6 Extension of the Framework 

The framework presented in this paper may be extended to cover more general settings, without affecting 
the validity of the obtained results. In this section, we briefly discuss some of these extensions. 

Composite hypothesis testing and robust detection. The sensing model introduced in Sec. 12.11 
can be generalized to the case of multiple active PUs, and the presence of device-level uncertainties (e.g., 
uncertainty in the power spectral density of the PUs' signals and thermal noise) as well as system level 
uncertainties (e.g., lack of knowledge of the number of active PUs). In this more general setting, there are 
2 configurations of possibly active PUs associated to the elements of the power 

we then propose to formulate the spectrum sensing problem of the SUs as a composite hypothesis testing, 
based on the following two sets of hypotheses: for each SU q = 1, . . . , Q, at carrier k = 1, . . . , N and time 
index n = 1, 2, . . . , K q , 

5 The power set of a given set S is the set of all subsets of S, including the empty set and S itself. 
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(PU signal absent) 7{ 0) fe : Vq,k[ n \ = w gA n ] (40) 
(PU signal present) U ljk : y q> k[n] = I q , k \K.[ n \ + %,fcM> for some £ G P(P)\{0} 

where %o,fc represents the absence of any primary signal over the subcarrier k, and T~Li t k represents the 
presence of (at least) one PU, with -^^[n] being the signal received by SU q over carrier k due to the 
presence of the active PUs, indexed by the elements of K,. 

Under the assumptions that i) the noise variance over each subcarrier is known at the receiver of each 
SU within a given uncertainty interval; and ii) the SUs do not know the current number of active PUs 
out of P PUs (the set /C), we proved that the decision rule (|2|), with y q ,k[n\ given in (|40p . is a Uniformly 
Most Powerful (UMP) test [35] , Roughly speaking, this means that ([2]) is optimal for both the false alarm 
and detection probabilities, in the sense that it reaches the desired false alarm rate (size of the test) while 
maximizing the detection probability over the entire range of noise uncertainty and uncertainty on the set 
of the active PUs. In [35] . we showed that the (worst-case) performance of the proposed test are formally 
still given by ([3]), but with a different expression for the constants fJ, q ^\i s and cr^^s. Thus, the analysis 
developed in the previous sections apply also to this more general model; because of the space limitation, 
we refer to |35| for details. 

Game-theoretical formulations in the presence of multiple PUs. To simplify the presentation, 
we have thus far considered scenarios where there is only one active PU and a single global interference 
constraint in the form of ([8]). The proposed game-theoretical formulations can be readily extended to the 
case of multiple active PUs and additional local/global interference constraints. For instance, suppose that 
there are (at most) P active PUs along with their associated local and/or global interference constraints, 
e.g., given respectively by (we write the constraints directly in the transformed variables j q< k and T q ): for 
each q = 1, . . . , Q, 

N 

ii p H? q ,p q ,%)±^p$ ss (% k ,%)-w$-p q> k - ir xip) <o, p=i,...,p, (41) 

k=l 

and 

Q N 

I {p) (t, P, 7) = E E P T ft**'**) ■ W S ■ P q ,k - r 11 ^ < 0, V = 1, • • • , P, (42) 

q=l k=l 

where the maximum average interference allowed to be generated by the SU q [or 

all the SU's] that is tolerable at the primary receiver p; w^^s are a given set of positive weights (possibly 
different for each PU p); and -P™/! ss (7q,fc, T q ) is the missed detection probability in the transformed variables, 
defined in (|15p . Average per- carrier local/global interference constraints can also be introduced |35| . 

To cast the system design in the general formulation (|23p -([24|). we can proceed as in Sec. 13.21 Similarly 
to (I19p and (|22p . we introduce the feasible set of local constraints of each user q, denoted now by X q , 

X q 4 |(r ? , Pq , %) G y q | I g (P\T q , Pq , %) < 0, Vp = 1, . . . , P} , 

with the convex part y q defined as in (|20p ; and the interference violation (column) vector function 
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I(r, p, 7) grouping all the global interference constraints (|42p . and defined as 

I(r,p,7)4(/(p)(r, p, y)Y . (43) 
\ / p=i 

Instead of having a single price variable, in the presence of multiple global interference constraints, we 
associate a different price tt^ to each global interference constraint I^ p \t, p, 7). Thus, we will have a 
vector of prices— the (column) vector 7r = (7r p )^ =1 — to be optimized along with multiple price clearance 
conditions (one for each pair price/interference constraint). Using the above notation, the resulting game 
theoretical formulation is the natural generalization of (|23p -([24"|) and is given next. 

Players' optimization problems. The optimization problem of player q is: 

maximize R q (r q , p, 7 f/ ) - n T I(r, p, 7) 

r q ,p q ,l q ^ 

subject to (f g , p q , j g ) e X q . 
Price equilibrium. The price obeys the following complementarity conditions: 

< 7T _L -I(t, p, 7) > 0. (45) 

The analysis of this more general game can be addressed following the same procedure as introduced 
in Sec. 3] for the game (|23p -([24"|). and thus is omitted. 



7 Numerical Results 

In this section, we provide some numerical results to illustrate our theoretical findings. More specifically, we 
compare the performance achievable at the QNE of the proposed game with those achievable by the state- 
of-the-art decentralized |17| (special cases are those in 1 12 ^ I13 [ [T4"]) and centralized |18] schemes proposed 
in the literature for similar problems; such schemes do not perform any sensing optimization using thus 
all the frame length for the transmission, and the QoS of the PUs is preserved by imposing (deterministic) 
interference constraints (we properly modified the algorithms in |18| to include the interference constraints 
in the feasible set of the optimization problem). Interestingly, the proposed design of CR systems based on 
the joint optimization of the sensing and transmission strategies is shown to outperform both centralized 
and decentralized current CR designs, which validates our new formulations. We then show an example of 
the optimal sensing/throughput trade-off achievable at the QNE. Finally, we compare the sum-throughput 
achievable by the SUs in the presence of local and global interference constraints [game formulation ([9]) 
vs. (|36|) ] . which sheds light on the achievable trade-off between signaling and performance. 

All the numerical results here are obtained using the algorithms described in the companion paper 
[32 j. Note that some of the proposed algorithms require some signaling among the users in the form of 
consensus schemes; when this happens, some {finite) time of the available (sensing-transmission) frame 
T, let us say T cons , needs to be allocated for this purpose. The interesting result is that our consensus 
implementation is proved to converge in a finite number of iterations (T cons is then bounded) . In order to 
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make our numerical comparisons fair, we included this time loss in the throughput functions, which are 
still given by ([5]), but 1 — T q /T is replaced by 1 — (r q + T cons )/T, with T cons < T and r q <T — T cons . 

Example ^1: How good is a QNE? In Figure [2] we compare the performance achievable at a QNE 
of the game (|36p with those achievable at the stationary solutions of the sum-rate maximization problem 
subject to interference constraints |18| [subplot (a)] and at the NE of the game in |17| [subplot (b)]. We 
consider a hierarchical CR network where there are two PUs (the base stations of two cells) and fifteen 
SUs, randomly distributed in the cells. The (cross-)channels among the secondary links and between the 
primary and the secondary links are simulated as FIR filter of order L = 8, where each tap has variance 
equal to l/L 2 ; the available bandwidth is divided in N = 1024 subchannels. For the sake of simplicity we 
consider only individual interference constraints (J7J), assuming the same interference limit J max for all SUs. 
Moreover we impose for each player the same false-alarm rate over all the subcarriers (possibly different 
among the players). To quantify the throughput gain achievable at the QNE of the proposed game, in 
Figure [21 we plot the (%) ratio (SRqe — SR)/SR versus the (normalized) interference constraint bound 
P/I max (P 9 = p r = p for all q ^ r), for different values of the SNR detection snr^ = o~j J°\^ where 
SRqe is the sum-throughput achievable at the QNE of the proposed game (with c = 100) whereas SR is 
either the sum-rate at a stationary solution of the social sum-rate maximization |18| subject to interference 
constraints [subplot (a)] or the sum-rate at the NE of the game in |17| [subplot (b)]. From the picture, 
we clearly see that the proposed joint optimization of the sensing and transmission parameters yields a 
considerable performance improvement over the current state-of-the-art CR centralized and decentralized 
designs, especially when the interference constraints are stringent. These results support the proposed 
novel formulation and concept of QNE. 

Example ^2: Sensing/throughput trade-off. Figure [3] shows an example of the expected trade-off 
between the sensing time and the achievable throughput. More specifically, in the picture, we plot the 
(normalized) sum-throughput achieved at a QNE by a player of the game versus the (normalized) common 
sensing time, for different values of the (normalized) total interference constraint (the setup is the same 
as in Figure [2]). According to the picture, there exists an optimal duration for the (common) sensing time 
at which the throughput of each SU is maximized. Moreover, as expected, more stringent interference 
constraints impose lower missed detection probabilities as well as false-alarm rates; requirement that is 
met by increasing the sensing time (i.e., making the detection more accurate). This is clear in the picture 
where one can see that the optimal sensing time duration increases as the interference constraints increase. 
In the same figure we also plot the sum-throughput achieved at the QNE of the game (I36p setting c = 100 
(square markers in the plot). Interestingly, the proposed approach based on a penalty function leads 
to performance comparable with those achievable by a centralized approach that computes the optimal 
common sensing time obtained by a discrete search of such a time. 

Example ^3: Global vs. local interference constraints. Global interference constraints impose less 
stringent conditions on the transmit power of the SUs than those imposed by the individual interference 
constraints, implying better throughput performance of the SUs (at the price however of more signaling 
among the SUs, as quantified in our companion paper |32j). Figure [4] confirms this intuition; in the 
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(a): Comparison with Centralized Approaches 




Figure 2: Performance achievable at the QNE of (]36[) : sum-rate gain (%) versus the normalized interference, for 
different values of the detection SNR and c = 100; comparison with centralized solutions [18] [subplot (a)] and 
game theoretical solutions [T7] [subplot (b)]. 

subfigure(a) we plot the average (normalized) sum-throughput of the SUs achievable in the game ([9]) [only 
local interference constraints] and (I36p [global interference constraints] as a function of the maximum 
tolerable interference at the primary receivers, within the same setup of Figure 1 (the curves are averaged 
over 300 random i.i.d. Gaussian channel realizations). In ([9]), the interference thresholds I™ ax are set 
jmax _ jmax^Q £ Qr & vj g an( j Jjq^^ pu S) so that all the SUs generate at the primary receivers the same 
average interference level and the aggregate average interference satisfies the imposed interference threshold 
jmax r n Figure H^b), we show an example of the average (normalized) interference profile measured at one 
of the (two) primary receivers, obtained solving the game ([9]) and (|36p . We clearly see from both pictures 
that, as expected, global interference constraints are less conservative than the local ones, yielding thus 
better performance of the SUs; this however comes at the price of some signaling among the SUs. 
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Figure 3: Example of optimal sensing/throughput trade-off: Normalized throughput versus the normalized sensing 
time, for different values of the (normalized) interference threshold. The square markers correspond to the QNE of 
the game (|36[) . achieved with c = 100. 
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Figure 4: Global versus individual interference constraints [game © versus game Subplot (a): average 

(normalized) sum-throughput as a function of the maximum tolerable interference / max at the primary receivers. 
Subplot (b): Average per-carrier probabilistic (normalized) interference generated by the solution of the games at 
one of the PU's receiver. 



8 Conclusions 

In this paper, we proposed a novel class of Nash problems wherein each SU aims to maximize his own 
opportunistic throughput by choosing jointly the sensing duration, the detection thresholds, and the vector 
power allocation over a multichannel link, under several interference constraints, either local or global. 
In particular, to enforce global interference constraints while keeping the optimization as decentralized 
as possible, we proposed a pricing mechanism that penalizes the SUs in violating the global interference 
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constraints. The resulting games belong to the class of nonconvex games with unbounded pricing variables, 
whose analysis cannot be addressed using mathematical tools from the existing game theory literature. 
To deal with these issues, we proposed the use of a relaxed equilibrium concept— the QNE— and studied its 
properties and connection with LNE. In particular, we proved that the proposed games always have a QNE, 
even when a (L)NE may not exist. We then validated the QNE both theoretically and numerically: i) we 
proved that the QNE has some local optimality properties; and ii) numerical results show the superiority 
of the proposed design with respect to the state-of-the-art centralized and decentralized resource allocation 
algorithms for CR systems. Distributed solution schemes for computing such equilibria along with their 
convergence properties are proposed and studied in the companion paper [32J. 



Appendix: Proof of Theorem [6] 

Because of the space limitation, we provide only a sketch of the proof. We need to show that the VI(5, \&) in 
(|32p has a solution. According to |42[ Proposition 2.2.3], it is sufficient to find a tuple (x ref , 7r ref , A ref ) G S, 



with x ref = (r g ref , p 9 ref , 7f f )J =lJ such that the set 



£< 4 { ( x , vr, A) G S 



/x- 


x ref 


7T — 


7T ref 




l A- 


A ref 



* (x ref , ^ ref , A ref ) < } (46) 

\ A - A ref j 

is bounded. We choose (x ref , 7r ref , A ref ) by letting all its components to be zero except for the r- and 7- 
components which we define as follows: 7^ = (3 q ^ and r ? ref = r g max for all k = 1, . . . , N and q = 1, . . . , Q. 
Invoking the feasibility conditions (|25[) and using the above definitions, we have: for all k = 1, . . . , N and 
q = l,...,Q, 

- re f A a g,k\0 (%,k ~ Pq,k) ~ Og,fc|l ~ Vq,k\0 ) (% ~ T™ ) 
Iqjk ~ lq,k = > 0. (47) 

Let (x, 7r, A) G £<, with the tuple (x ref , 7r ref , A ref ) defined above. Clearly, the players' variables x = 
(t, p, 7) are bounded: for every q = 1, . . . , Q, we have that (r q , p q , 7 ) G X q implies 

jmax - ^ jmax n - _ - „max „„ j o ^ - ^ a qM + ( ^q,k\l ~ f^k\0 ) T q a -max yji, 

r Q <r q <T , < p < p , and fj q k < 7 g . fc < 7 k , Vfc. 

<Tq,k\0 



Next, we show that the multipliers A and the price ir are also bounded. Multiplying out [(x — x ref ) T , 
(vr - vr ref ) T , (A - A ref ) T ] * (x ref , vr ref , A ref ), using the expression of * (x ref , 7r rei , A ref ), canceling some 
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terms, and collecting the remaining terms, we deduce 



Q — /~ _ -ret \ N Q N / 

o> 2 Y ? q ( ! T Tq ] E ( 1 - ea,») ) ^ (p) + EE 1 

<j=l '' 9 9 fe=l g=l fc=l V 



J^T j ( 7g,fc - Aj.fc ) Q '(%,*) r g,fc (P) 



VV (l 9 " \ Q—^hD \H qq {k)\ 2 Pq {k) y-f - (A , ) 



W q , k -p q {k) Q'(%k) 



\ ^ \ j max i _ t i 

9=1 



(49) 



Using the fact that the third and the fourth term on the right hand side of (|49p are nonpositive [the 
nonpositivity of the fourth term comes from ()47|) and Q'^Jg k) 0], we have 



Q Q N 

E A 9 7 g max + 7T ■ 7 max < EE 

9=1 g=lfc=l 



2r q (? q ^-? q ) \ 



(l-Q(%,k))+\ 1 



f q T q 



Q N 

^EE 

q=l fc=l 



o C max /Jmax _ ^ 1 
z 'g ^ 'g 'g 



+ 1- 



/2vr 



loe 1 



7g,fc ~ A?,fc J \Q'(%k)\ 



\H qq (k)\ 2 P ^ 



r q ,k ( 



cr 



q,k 



where the last inequality follows from the bounds (|48|) and Q (^f q k) !_ 1/V2n, which establishes the 
boundedness of the elements in the set C<. 

The last part of the theorem follows from Proposition [5] and the easy check that a trivial QNE cannot 
satisfy ([32]). □ 
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